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Abstract 

This paper investigates the benefits of Amplify-and-Forward (AF) relaying in the setup of multi-antenna wireless networks. 
Reference [1] introduced the idea of Random Sequential (RS) relaying and showed that it achieves the maximum diversity gain 
in a general multi-antenna network. Here, we show that random unitary matrix multiplication at the relay nodes empowers the 
RS scheme to achieve a better Diversity-Multiplexing Tradeoff (DMT) as compared to the traditional AF relaying. First, we study 
the case of a multi-antenna full-duplex single-relay two-hop network, for which we show that the RS achieves the optimum 
DMT. Applying this result, we derive a new achievable DMT for the case of multi-antenna half-duplex parallel relay network. 
Interestingly, it turns out that the DMT of the RS scheme is optimum for the case of multi-antenna two parallel non-interfering 
half-duplex relays. Next, we show that random unitary matrix multiplication also improves the DMT of the Non- Orthogonal AF 
relaying scheme of [2] in the case of a multi-antenna single relay channel. Finally, we study the general case of multi-antenna 
full-duplex relay networks and derive a new lower-bound on its DMT using the RS schemqj. 



I. Introduction 

Recently, cooperative schemes have been proposed as candidates to exploit the spatial diversity offered by the relay networks 
(see for example [2], [5]-[9] and references therein). Decode-and-Forward (DF), Amplify-and-Forward (AF) and Compress- 
and-Forward (CF) are the primary relaying strategies discussed in the literature. While DF and CF strategies are popular 
for small-scale networks to achieve or approach capacity values (see for example [10]-[12] and references therein), the AF 
relaying turns out to be more suitable in realizing cooperative transmission strategies (see for example [1], [2], [5], [8], [13] 
and references therein). 
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In AF relaying, the relays do not decode the transmitted messages. Instead, they forward their received signals to the 
destination after a proper amplification. Hence, the relays require less computing power and the end-to-end system experiences 
a smaller delay in comparison with the other relaying strategies. Accordingly, AF relaying schemes are suitable for deployment 
in practical wireless systems. Moreover, in contrast to DP relaying, the performance of AF relaying is not limited by the source- 
to-relay channel quality. Also, unlike CF relaying, parallel relays can be used in conjunction with the AF strategy to realize 
the power boosting offered by coherently combining the received signals at the destination [9]. Indeed, it is shown that AF 
relaying is asymptotically (large number of relays) optimal (capacity achieving) for both single-antenna [14] and multi-antenna 
[9], [15] parallel relay networks. 

While AF relaying is investigated well in single-antenna networks, much less is known about its appUcation in multi-antenna 
networks. Indeed, unUke the single-antenna scenario, in this case the AF multipliers are matrices rather than scalars. Hence, 
finding the optimum AF matrices becomes challenging. 

A fundamental measure to evaluate the performance of cooperative schemes is the DMT, which was first introduced by 
Zheng and Tse in the context of point-to-point multi-antenna fading channels [16]. Roughly speaking, the diversity-multiplexing 
tradeoff identifies the optimal compromise between the transmission reliability and the data rate in the high Signal to Noise 
Ratio (SNR) regime. 

The Non-orthogonal Amplify-and-Forward (NAF) scheme, first proposed by Nabar et al. in [17], has been further studied by 
Azarian et al. in [2] for the single-antenna multi-relay setup. In addition to analyzing the DMT of the NAF scheme, reference 
[2] shows that NAF is the best in the class of AF strategies for single-antenna single-relay systems. However, the NAF scheme 
falls far from the upper-bound in multi-relay systems. 

Yang and Belfiore in [13] study the DMT performance of the NAF scheme for the multi-antenna parallel relay setup. 
Moreover, based on the non-vanishing determinant criterion, the authors constructed a family of space-time codes for the NAF 
scheme over multi-antenna channels. However, as shown in [13], the NAF scheme falls far from the DMT upper-bound in the 
multiple-antenna setup, particularly for small values of multiplexing gain. Indeed, even for the case of a multi-antenna two-hop 
single-relay setup, the NAF scheme is unable to achieve the maximum diversity gain of the system. 

Recently, [1] has studied the achievable DMT of the AF relaying for a general multi-relay network. For this purpose, 
they propose a new AF relaying scheme which is called RS. They show that the RS scheme achieves the optimum diversity 
gain in a general multi-antenna multi-relay network. However, they established the DMT optimality results only for the single- 
antenna multi-relay network. Also, the authors in [18] independently obtained the same DMT optimality result using a different 
approach. 

Yuksel et al. in [6] apply CF strategy and show that CF achieves the DMT upper-bound for multi-antenna half-duplex 
single-relay networks. However, in their proposed scheme, the relay node needs to know the Channel State Information (CSI) 
of all the channels in the network, which may not be practical. More recently, Avestimehr et al. in [12] show that a variant 
of the CF relaying achieves the capacity of any general single-antenna Gaussian relay network within a fixed number of bits, 
which only depends on the number of nodes in the network. Furthermore, the authors in [19] show that the result of [12] is 
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Still valid for both multi-antenna Gaussian relay networks with or without fast (ergodic) Rayleigh fading. 

In this paper, we investigate the benefits of AF relaying in multi-antenna multi-relay networks. For this purpose, we study 
the appUcation of the RS scheme proposed in [1]. The key elements of the proposed scheme are: 1) signal transmission through 
sequential paths in the network, 2) path timing such that no non-causal interference is caused from the transmitted signal of 
the future paths on the received signal of the current path, 3) multiplication by a random unitary matrix at each relay node, and 
4) no signal boosting in AF relaying at the relay nodes, i.e. the received signal is ampUfied by a coefficient with the absolute 
value of at most 1. We derive the DMT of the RS scheme for multi-antenna multi-relay networks. To accomplish this task, 
we first study a simple structure, namely the multi-antenna full-duplex two-hop single-relay network. We show that unlike the 
traditional AF relaying, the RS scheme achieves the optimum DMT. This fact can be justified as follows: using the traditional 
AF relaying, there exists a chance that the eigenvectors corresponding to the largest eignenvalues of the incoming channel 
matrix of the relay project to the eigenvectors corresponding to the small eignenvalues of the relay's outgoing channel matrix. 
This event degrades the performance of traditional AF relaying in the mulli-antenna setup. However, in the RS scheme, due 
to the random independent unitary matrix multiplication at the relay nodes for different time-slots, such an event is much less 
Ukely to happen. This fact will be elaborated throughout the paper. 

Next, we study the case of multi-antenna half-duplex parallel relay network and, by deriving its DMT, we show that the RS 
scheme improves the DMT of the traditional AF relaying scheme. Interestingly, it turns out that the DMT of the RS scheme 
is optimum for the multi-antenna half-duplex parallel two-relay (K = 2) setup with no direct hnk between the relays. We also 
show that utihzing random unitary matrix multiphcation improves the DMT of the NAF relaying scheme of [2] in the case of 
a multi-antenna single relay channel. 

Finally, we study the class of general full-duplex multi-antenna relay networks whose underlying graph is directed acyclic 
and all nodes are equipped with the same number of antennas. Using the RS scheme, we derive a new lower-bound for the 
achievable DMT of this class of networks. It turns out that the new DMT lower-bound meets the optimum DMT at the comer 
points, corresponding to the maximum multiplexing gain and the maximum diversity gain of the network, respectively. Another 
point worth mentioning is that the RS scheme is robust in the sense that it achieves all points of the DMT curve with no 
modification of the underlying parameters. In other words, the relay nodes of the network perform the same operation, no 
matter at which point of the DMT curve the scheme is operating. 

The rest of the paper is organized as follows. System model is introduced in section II. Section III is dedicated to Diversity- 
Multiplexing Tradeoff analysis of the RS scheme in the multi-antenna setup. This section is further divided into four subsections 
as follows. Subsection 111-A studies the multi-antenna single-relay two-hop network and also the multi-antenna multi-hop relay 
network with one relay in each hop. Subsection III-B is dedicated to the multi-antenna half-duplex parallel relay network. 
The well-known multi-antenna single-relay channel (with direct link between the source and the destination) is investigated 
in subsection III-C. Subsection III-D studies the achievable DMT of the RS scheme for the general multi-antenna full-duplex 
relay networks whose underlying graph is directed acyclic. Finally, section IV concludes the paper. 
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A. Notations 

Throughout the paper, the superscripts ^ and ^ stand for matrix operations of transposition and conjugate transposition, 
respectively. Capital bold letters represent matrices, while lowercase bold letters and regular letters represent vectors and 
scalars, respectively. ||v|| denotes the norm of vector v, while || A|| represents the Frobenius norm of matrix A. |A| denotes the 
determinant of matrix A and Tr{A} denotes the trace of matrix A. log(.) denotes the base-2 logarithm. The notation A =<; B is 
equivalent to B — A being a positive semi-definite matrix. Motivated by the definition in [16], we define the notation f{P) = 
g{P) as limp^^ = limp^^ Similarly, f{P)<g{P) and f{P)>g{P) are equivalent to limp^^ jgpy < 

limp_>(x) \og^p) limp-xx) ^ limp^oo iog(p) ' respectively. Finally, we use A ^ B to denote the approximate 

equality between A and B, such that by substituting A with B the validity of the equations are not compromised. 

II. System Model 

In this paper, we study the DMT for various setups of multiple-antenna relay networks. The common assumptions for all 
setups are: 

• The network consists of a single source, one or multiple relays and a single destination 

• AH nodes in the network are assumed be equipped with multiple antermas. However, the number of antermas at each 
node are denoted differently depending on the scenario being discussed. In the setup of Theorem 1 in subsection III-A, 
which studies the DMT of a two-hop single-relay network, the number of antennas at the source, relay and destination are 
denoted by m, p, and n, respectively. However, in Theorem 2 of this section which considers the setup of multi-hop relay 
network, the number of antennas at nodes are denoted by A^o, ^i, • ' ■ ; where h denotes the total number of hops. 
In subsection 111-B, in which the setup of a parallel relay network is considered, the number of antennas at the source 
and the destination are assumed to be m and n. In Theorem 3, K parallel relays with the same number (denoted by p) 
of antennas are considered while Theorem 4 studies the setup of two parallel relays with different number of antennas 
denoted by ni and n2 while assuming n = m. The general setup of a single-relay network (with direct link between the 
source and the destination) is studies in subsection III-C (Theorem 5) and the number of antermas at the source, relay 
and destination are denoted by m, p, and n, respectively. Finally, the general setup of a multi-anterma full-duplex relay 
network with directed acyclic underlying graph is considered in subsection III-D (Theorem 6) in which the number of 
antennas at node i is denoted by Ni. 

• All channels in the network are assumed to be circularly symmetric zero-mean unit-variance Gaussian with gains that 
remain unchanged during the entire transmission period (quasi-static Rayleigh fading). For each Unk, the receiver side 
is assumed to have perfect knowledge about the corresponding channel and the transmitter side is assumed to have no 
knowledge about the channel. 

• The noise vector at each node (relay or destination) is assumed to be additive white with circularly symmetric zero-mean 
unit-variance Gaussian distribution. 

• The relays are assumed to work in full-duplex mode in sub-sections IH-A and ni-D and half-duplex mode in sub-sections 
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III-B and III-C. 

The source and the relay(s) have power constraints E{x^xt} < P and Efx^x^} < P, respectively, where the average 
is taken with respect to xt for the source and with respect to x^ and (the received signal at the relay) for the relay^ . 
As the main focus of this paper is the analysis of DMT, the source and the relay(s) are assumed to operate in the high 
SNR regime (P oo). 



III. Diversity-Multiplexing Tradeoff 

A. Two-Hop Single Relay Network 

This setup corresponds to the network consisting of a source, a single full-duplex relay and a destination with no direct link 
between the source and the destination. The source, relay, and destination are equipped with m, p, and n antennas, respectively 
(see Figure [T]l. The channel between the source and the relay is denoted by H and the channel between the relay and the 
receiver is denoted by G. First, we study the achievable DMT by the traditional AF scheme in Lemmas [1] and |2] and show the 
achievable DMT in general does not match with the optimal value, which is achievable by the DF schemj^. Then, in Theorem 
[T]we prove that using the proposed RS scheme, which is a modification of the traditional AF scheme, the optimal DMT is 
indeed achievable. Theorem |2] generalizes the result of Theorem [T] to the case of multi-hop relay network and shows that the 
proposed RS scheme still achieves the optimum DMT provided that a certain relationship between the number of antennas at 
nodes is satisfied. 

_____ p receive/transmit antennas _ _ _ _ , 



m 

antennas 



J: 

Tx 



H; 

First Hop 



G 



n 

antennas 



Relay 



::::: 

Second Hop 



Fig. 1 . Schematic of a multi-antenna single-relay two-liop network 

In the traditional AF strategy, the received signal at the relay is multiplied by a constant a such that the power constraint 
at the relay is satisfied and then it is transmitted to the destination. The corresponding received signal at the destination can 
be written as 



y = aGHxf + aGn,. + rid, 



(1) 



where xt denotes the transmitted signal from the source and ~ CJV{0,Ip) and ~ CJ\f{0,In) denote the noise vectors 
at the relay and at the destination, respectively. 



Note tliat as tlie cliannels are assumed to be fixed during tlie wtiole transmission period, tiie expectation are not taken witli respect to any of tlie ciiannel 



matrices 
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In fact, tliis configuration is a special case of tlie degraded relay channel studied by [11]. In [11], the authors show that the DF scheme achieves the 
capacity of the degraded relay channel. 



Lemma 1 The DMT of the system given in (|7| is upper-bounded by the DMT of the following system: 

y = QfGHxt + rid, (2) 
and is lower-bounded by the DMT of the following system: 



y = aGHxt + Vclog(P) + In^, (3) 

for some constant c. 

Proof See Appendix U ■ 

Lemma 2 The DMT of the systems in Q and ^ are equal. 

Proof: See Appendix HH ■ 
A direct conclusion of the Lemmas [T| and |2] is that the DMT of the two-hop network can be expressed as the DMT of the 
product channel GH which is computed in [13]. Due to the result given in Proposition 1 in [13], assuming m,n > p, the 
DMT of the product channel A — GH is a piecewise-Unear function connecting the points {r,dA.(r)),r — 0,1, ... ,p, where 



dA{r) = {p-r){q-r) - ^ 



[{P- 



(4) 



2 

q = min(m, n) and A = |m— n|. On the other hand, the piecewise-linear function connecting the integer points (r, {p~r){q—r)) 
can be easily derived as the upper-bound by considering each of the source-relay or the relay-destination cuts. Comparing 
(HJi with the upper-bound, it follows that the traditional AF scheme achieves the optimum DMT only when r > p — A. This 
motivates us to use a variant of AF scheme which achieves the optimum DMT in all cases. In fact, using the traditional 
AF scheme, there are three sources of outage: (i) the outage in the source-relay link, (ii) the outage in the relay-destination 
link, and (iii) the projection of the eigenmodes of H over the eigenmodes of G is very small. More precisely, the matrix 
V^(G)U(H), in which V^(G) denotes the right eigenvector matrix from the Singular Value Decomposition (SVD) of G and 
U(H) denotes the left eigenvector matrix from the SVD of H, has very small eigenvalues. The extra term i Kp^'^^^) 1 j jjj 
(IHi is due to the third source of outage. The first two outage events depend on the distribution of the eigenvalues of H and G, 
while the third event depends solely on the direction of the eigenvectors of these two matrices. This suggests that in order to 



eliminate the extra terms in -i 



^^P^^^^'f ^ j one can multiply the received signal at the relay by a®, for some p x p unitary 
matrix (for preserving the power constraint at the relay). However, it should be noted that if does not change across the 
transmission block, the performance of the systems does not change. Therefore, we propose that in each transmission slot an 
independent unitary matrix 0; is used and at the destination side the decoding is performed across L transmission slotj^. This 
is exactly what is being done in the proposed RS scheme in [1]. Indeed, this setup is a simple example of the general setup of 
the relay network studied in [1] in which the source and the destination are connected through a single path. In this case the 
proposed RS scheme reduces to the following: the source's message is sent using L slots through the same path; at the relay 

^From practical point of view, the transmission slots are assumed to be long enough to make the probability of en'or solely dominated by the outage event. 
This fact is more elaborated in [16]. 



side the received signal is multiplied by a randomly independent (through different slots) unitary matrix, and subsequently, it 
is multiplied by a scalar a < Jfl such that the power constraint is satisfied, and the result is transmitted in the next slot. At the 
destination, following receiving the signal of the slots 2,3, . . . , L + 1, the source message is decoded. In the following theorem, 
we show that as long as L is above a certain threshold, the probability of the third outage event is negligible compared to the 
first two outage events and hence, the optimum DMT is achievable by the RS scheme. 



Theorem 1 Consider a two-hop network consisting of a source with m antennas and a destination with n antennas that are 
connected through a full-duplex relay with p antennas. Let us define q = min(r7i, n). Providing L is large enough such that 
L > mm^{p,q)msix{p,q), the RS scheme achieves the optimum DMT, which is the piecewise-linear function connecting the 
points {k, (p — k){q — k)), k ~ 0,1, . . . , mm{p, q). 

Proof: Using Lemmas [T] and |2] the DMT of the system using the proposed RS scheme is equal to the DMT of the 
following system: 

Y = anXt + Nd, (5) 

where ^ [xt(l),--- ,X((L)]^, Y = [y(l),--- ,y{L)f, and ^ [nd(l),-- - ,n<j(L)]^, in which Xt(/), y(0 and nd{l) 
denote the transmitted signal, received signal and noise in the ^th slot, respectively, and 

Ai ••• 
A2 ••• 



(6) 



•• Al 

in which A; = G©;H. Hence, the matrix of the end-to-end channel is a block diagonal matrix consisting of Aj's. Assuming 
that the transmitted signals in each slot are independent of each other, the mutual information between the input and the output 
of (|5]l can be written as 



I(Xt;Y)=^log 



1=1 



P 



I + a — A,A, 

m 



H 



(7) 



in which it is assumed that Xt(Z) ^ CA^(0, ^Im), = 1, • • • ,k. Using the above equation, the probability of outage can be 
written as 



. 1=1 



< Lr\og{P) 



or equivalently, 



L min(p,g) 

P{0}-P<j^ \og[l + a^^\,{Ai)] <Lr\og{P) 

1=1 j=i 



(8) 



(9) 



^Note that, as can be observed from the proof of Lemma[T]in Appendix I, the constraint a < 1 does not affect the DMT of the system. 



where Ai(A) denotes the ith ordered eigenvalue of A^A (Ai > A2 > • • • > Amin)- Defining 7j(B) = ~— ^^(^^ and 



^ - -T^FT' ^^^^ 



L min(p,g) , - 

J2 J2 ^ogi^ + —P^'^"''''^'^) < Lr\og{P) 

1=1 .7 = 1 ^ ^ 



P{0} 



L min()9,g) I 

^ ^ (l-<S-7,(A0)+<Lrl. (10) 
.'=1 |-| J 

First, we show that a = 1 (or 5 = 0), with probability onqj. For this purpose, we write as follows: 

1 ■ ( P 

a = mm 1, 

= min 1, 



min 1, 



Ex„n„{||Hxt + n,||2} 
P 

Tr{HQx,H^ + I} 
P 

Tr{ilHH- + l} 



= minfl,^; |. (11) 

From the above equation, we have 

P{S > So} < F {\\Hf > P^"-'} , (12) 
for all So,e > 0. Noting that ||H|p has Chi-square distribution with 2mp degrees of freedom, it follows that 

pmp(5o-e) 

P|||H|P > p-'f'-n ~ — exp{-P^"-'] . (13) 

^ ^ [mp)'. 

Choosing & = the above equation implies that for Sq > 0, the probability P{5 > 5q} approaches to zero much faster than 

polynomially. More precisely, defining the event = {6> 0}, we have P{^} = o{P^'^) for any positive constant c. Since 

P{0}>P~'^'^~^^^P~'^'> (lower-bound on the outage probability corresponding to the DMT upper-bound ), we can write 

F{0} = P{C'|.i?}P{J?} +P{C'|^=}P{,jr=} 

PjOl^^}, (14) 

where (a) follows from the fact that P{^} = o (PjO}). In other words, one can replace 5 with zero in (fTOl l. which results in 

P{0}=P<^^ ^ (l-7,(A0)+<Lrl. (15) 



Moreover, we have 



1=1 3 = 1 



H^i) < \\Ai 

(a) 



2 



< l|G|ri|H||2, (16) 

'Note that due to the definition of a, we always have S > 0. 



where (a) results from the fact that ||AB|p < ||A|p||B|p, for any two matrices A and B. Consider a negative number e. 
From the above equation, it follows that 

P{7z(A0<£} < P{||G||2||H||2>P-^} 

< P{||G||2 > p-f} +p|||Hf > p-§} 
P :^exp|-p-n + :^:3:exp{-p-t| 

{np)\ {rnp)\ 
= o (^p-(9-'-)(P-'-)^ . (17) 

As a result, following ( fT4b . we can assume that 7j(A/) > 0, Vj = 1, • • • , min(p, q), in ( fTSl ). 

In order to compute the outage probability in (fTSl ). we need to find the statistical behavior of 7j(A;). Since we are interested 
in upper-bounding the outage probability of the RS scheme, finding an upper-bound for ^j{Ai), or equivalently, a lower-bound 
for Aj(A;) would be sufficient. This is performed in the following lemma. 

Lemma 3 Consider matrices G and H with the size of m x p and p x n, respectively, and a p x p matrix &. Assume G and 
H are singular value decomposed as G = U(G)A5 (G)V-^(G) and H = U(H)A5 (H)V^(H), respectively. We have 

K (G0H) > A,(G)A,(H)A„,i„ (v(^ „)(G)0U(i^,)(H)) , (18) 

where Ai(A) and A„iin(A) denote the i'th largest eigenvalue and the minimum eigenvalue of A, respectively, and A(^a.b) 
denotes the submatrix of A consisting of the a, a + 1, . . . ,b'th columns of A. 

Proof: See Appendix Hill ■ 
The above lemma relates Ai(A;) to Ai(G) and Ai(H), which facilitates the subsequent derivations. A direct consequence 
of the above lemmas is that 

7,(A0 < 7,(G) + 7.(H) + 7„iin(*.M), (19) 

where ^ i i = j-j(G)0;U(i j)(H). As the statistical behaviors of 7i(G) and 7i(H) are known from [16], it is sufficient to 
derive the asymptotic behavior of 7min(*i,i)' oi" equivalently, Xaimi'^ which is performed in the following lemma: 

Lemma 4 Assuming small enough e, we have 

P{A„,in(*»,i) < e} < ry^, (20) 

for some constant rj. 

Proof: See Appendix |IV| ■ 
A direct consequence of the above lemma is that 

P{7mi„(*,j) >^}<-P"^- (21) 



10 



Defining the L x 1 vector ^p = ["0(1)1 ■ ■ • : ^{L)]'^ as = max7inin('5'i.i), we have 



L 



F{V'>^o} 



(a) 



nFW(o>V'o(o} 



1=1 

L I min(p,(}) 



niPi U (7min(*M) > V'O(O) 




(b) 

< P 




(22) 



As ©i's are independent isotropic unitary matrices, their products with any possibly correlated set of unitary matrices constructs 
a set of independent isotropic unitary matrices [20]. Accordingly, ^ i/s, are independent for different values of /, which results 
in (a). Also, (6) follows from Lemma |4] and the union bound inequality. 

Let us define the lxmin(p,(7) vectors x(H) = [7min(p,m)(H),7min(p,m)-i(H), . . . ,7i+mi„(p,„)_min(p,g)(H)] and x(G) = 
[7min(p,n)(G),7mi„(p,„)_i(G), . . ., 7i+min(p,n)-min(p,9) (G)] , respectively. Notice that these vectors include the log-values of 
the corresponding min(p, g) smallest eigenvalues of HH^ and GG^, respectively. Now, applying the result of Lemma |3] to 
(flST l. we can upper-bound the outage probability of the end-to-end channel as 



where (a) follows from the fact that the log-values (7i's) corresponding to the smallest eigenvalues of HH^ and GG^ are 
greater than the log-values corresponding to the largest eigenvalues of these matrices. According to (|22] |. to upper-bound the 
outage probability, it is sufficient to upper-bound the probability of the region of (■»/', x(H), x(G)) that satisfies (l23T l. The 
following lemma gives a general formula for computing such an upper-bound: 

Lemma 5 Consider a fixed region TZ C [0,oo)". Assume that a uniformly continuou^non-negative function /(x) f/(x) > Oj 
is defined over [0, oo)" such that for all x £ [0, oo)" we have P{y > x} <P^f(^\ Then, we have 



'a uniformly continuous function / : — > AT where Ai C M™, A/" C R" is a function tliat has the following property: for every e, there exists a constant 
g{e) > such that for all x,y e Ai, ||x — y|| < 3(e), we have ||/(x) — /(y)|| < e. 




(23) 



P{x e 7^}<P 



(24) 



Proof: See Appendix IVl 
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According to the upper-bound in ( l22l i and the distribution of x(G), x(H) derived in [16], we have 

r{ip>ip, x(G) > x', x(H) > x"} < p~^^^^ ^'=' m-T.zf-'^ a.x-'+b,x: 

I' p-T;T4F:^Ef=i'A(0-2:™<''''''(2^-i+|p-9l)(x;+x'/)^ (25) 



where = 2[i + niin(p, m) — niin(p, q)] — 1 + \p — m\ and bi = 2[i + min(p, n) — miii(p, g)] — 1 + |p — n| and (a) follows 
from the fact that 

ai — 2[i + min(p, to) — min(p, q)] — 1 + \p — m\ 
= 2i — 1 + 771 + p — 2 niin(p, q) 
> 2i — 1 + q + p — 2 min(p, q) 

= 2i^l + \p-q\, (26) 

and similarly, > 2i — l + \p — q. Now, we can apply the result of Lemma |5] to the region defined in ( |23] | and the upper-bound 
derived in dZST l. Accordingly, we have 

P {O} <P^™'"'x(G;),x(H),v.)eK „in(p,,) '/j(0 + E™'''''''(2i-l+|p-g|)(xi(G)+x.(H)) ^27) 



where the region TZ is defined as 



7^ ^ (x(G),x(H),tA) 



> 0,Xl(G) > • • • > Xmin(p,,)(G) > 0,Xl(H) > • • • > Xmin(p,,)(H) > 



L min(p,ij) 

£ (l-X^(G)-x^(H)-^(0)+<Lr ^. (28) 



=1 i=i 



Let us assume i > mm{p, q) 2i — I + \p — q\j = min^(p, g) max(p, (7). We define miii(p, g) x 1 vector tp = 

bl, ■ • ■ , <Pmin(p,9)]^ as ipi = Xi (G) + x^(H) + i Eti V'(0- For each (x(G), x(H), V) e 7^, we have 

L min(p,(j) 

> E E (l-X.(G)-x.(H)-^(Z))+ 

/=1 i=l 
min(p,q) i 

= E E°ia^{0,l-x.(G)-x.(H)-7^(/)} 

i=l 1=1 

> ^ maxio,^l-X»(G)-x,(H)-^(/) 

Z^l I 1^1 ) 

min(p,g) 

= L ^ (l-V'^)^- (29) 

j=i 

On the other hand, according to ( |27] | we have 

PjO} < p~™'"(x(G).x(H),v)eTC niin(p,g) T.i = i ( +1] ' (2 »- 1 + | P" 9 1 ) (xi (G) +Xi (H) ) 

< p- rain(x(G).x(H).v)eTC ESi*"'"' (2i-l+|p-9l)Vi 

^ ( ) 

< p- mi'Ve* SZ?"']" (2»-l + |p-'3l)¥'.^ (30-) 
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where TZ is defined as 

inin(p,g) 



'fl> ■■■>Vrain{p,q)>0, i'^ ~ 'Pt)^ < ^ ( : (31) 



noting that according to the definition of (p we can easily conclude that (fi > ■ ■ ■ > Vmin{p,q) ^ 0. According to [16], (l30l ) 
defines the probability of outage from the rate r log(P) in an equivalent p x q point-to-point multi-antenna Rayleigh fading 
channel. Hence, we have 

dRs{r) > dpxq{r). (32) 

On the other hand, due to the cut-set bound Theorem [21] we know that the DMT of the system is upper-bounded by the 
minimum of the DMT of the equivalent point-to-point p x m and n x p multi-antenna channels. Hence, 

dRs{r) < dopt{r) = dpxq{r). (33) 

Comparing ( |32] | and ( l33T l completes the proof. ■ 
The statement of Theorem [1] can be generalized to multi-hop networks. However, in a general multi-hop network, the RS 
scheme does not necessarily achieve the optimum DMT for any number of antennas at the network nodes. The following 
theorem gives a sufficient condition for the RS scheme to achieve the optimal DMT in a multi-hop network: 

Theorem 2 Consider a multi-antenna multi-hop network consisting of a single source and destination and full-duplex relays, 
with exactly one relay in each hop. Assume that each relay is connected to the relays in the previous and next hop. 
Moreover, assume that for a fixed \ < m < h, we have max (iVm, ./Vm-i) < min (iVo, A^i, . . . , Af,n-2, -^m+i, • ■ ■ , -^h) 
where h denotes the number of hops and Ni denotes the number of antennas at the relay in the i 'th hop. ( Nq and Nh 
denote the number of antennas at the source and destination, respectively). Providing L is large enough such that L > 
min^ {N,n, Nm-i) iLna.x{N,n, Njn-i), the RS scheme achieves the optimum DMT, which is the piecewise-linear function con- 
necting the points {k, {Nm — k){Nm-i — fc)), A: = 0, 1, ... , min(A^„j, Nm-i). 

Proof: Using the same argument as in Theorem [T] we can show that the probabihty of outage from the rate rlog(P) is 
equal to 

P{O}-p|^^"(l-7,(A0)+<Lr|, (34) 

where A'min — min {N„i, N„i-i\, Ai = G/i0;./i_iG;,_i • • • ©;_iGi, and denotes the channel matrix between the nodes 
of the i'th hop and i — I'th hop. On the other hand, applying the argument of Lemma [3] we have 

7,(A0 = 7.(G,0 + 7min(*i,i,/«-l) + %{Gh-i) + ■■■+ 7min(*^,^,l) + 7»(Gi), (35) 

where "i^i.ij = (Gj+i)0; jU(i jj (Gj©;.j-iGj_i • • • 0;^iGi). Moreover, we can easily check that the statement of 

Lemma |4] is yet valid. Hence, similar to the proof of Theorem [T] we define the L x 1 vector ip ^ • • ,4'{L)]^ as 
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tp{l) = max7inin(*i,ij)- We have 

L 

p{t/'>t/.o} = l[p{m>Mi)} 



1=1 

L ( Af„i„ h 

1=1 y 1=1 j=i 

< P (36) 

As 0;j's are independent isotropic unitary matrices, their products with any possibly correlated set of unitary matrices 
constructs a set of independent isotropic unitary matrices [20]. Accordingly, ; j's are independent for different values of 
l,j, which results in (a). Also, (&) follows from Lemma |4] and the union bound inequality. 

Accordingly, applying dSST l to ( l34b and rewriting inequality series of ( l23T l, we can upper-bound the outage probability as 

P{0}<p|^g" |^l-^^X.(Gj)-VX0j <Lr^, (37) 

where Xii^j) — 7JV„,i„+i-i(Gj), i.e., the reverse ordering of 7i(Gj)'s. Let us define the TVmin x 1 vectors x(Gj)'s as 
x(Gj) = [xi(Gj), X2(Gj), . . . , XAr,„i„(Gj)]"'" containing the corresponding log-values of the N^un smallest eigenvalues of 
GjG^ . Notice that xi(Gj) > X2{Gj) > • • • > XJV„i„(Gj) > 0. According to the upper-bound in (|36] | and the statistical 
behavior of the eigenvalues of GjGj^ derived in [16], we have 

] " (i+min(Afj,Ar,_i)-Af„i„)-l+|Wj-Af3-i|)x.(G,) 

(a) 



'{i'>i',x{G,)>x{G,),j = i,...,h} < p-^^Zf^.'^W-Ej^.M-W 



Here, (a) results from the fact that 2 min (A'j , Ar,_i) - 27Vmin + - A'j-il = A'j +A^j-i-27Vmi„ > N„i + Nm-i-^Nmhi = 
\N„i — A^m-i| which comes form the assumption of max {N„i, ^m-i) < min (A'o, A^i, . . . , N„i^2, N^+i, ■ ■ ■ , Nh). Now, we 
can apply the result of Lemma |5] to the region defined in ( [38] l and the upper-bound derived in dJTj i. Accordingly, we have 

P{0}<p-'"i"(x(Gi).....x(G^).v-)e7^ivi-Ef=iV'(O+Er=T"(2»-i^ (39) 



where the region TZ is defined as 



= <^ (x(Gi),x(G2),...,x(G,.),V') 



> 0,xi(G,) > X2(G,) > • • • > x,„i„(p,,)(G,) >0,j = l,2,...,h 

+ 



L min(p,g) 

E (1-^(0-E^^(G;,)] <LrL (40) 

i=i 1=1 y j=i J J 

Similar to the proof of TheoremlT] we define N^in x 1 vector (p = [ipi, ■ ■ ■ , VAr„i„]^ as ipi = X]j=i Xi(Gj) + X^z^i '/'(O- 
Rewriting the inequality series in ( |29] l and ( l30l l. we can upper-bound the outage probability as 

where 7^ is defined as 

min(p,g) ^ 

TZ^ {ip ipi>---> <^Ar„.„ > 0, ^ (1 - ip,)+ <r\. (42) 
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According to [16], (|4TI ) and ( l42l i define the probability of outage from the rate rlog(P) in an equivalent Nm x Nm-i 
point-to-point multi-antenna Rayleigh fading channel. Hence, we have 

dB.s{r) > dAr„xW„_i(r). (43) 

On the other hand, due to the cut-set bound Theorem [21], we know that the DMT of the system is upper-bounded by the 
minimum of the DMT of the channels of different hops. Hence, 

dRs{r) < dopt{r) = rf7v„x7V„_i (r). (44) 
Comparing ( |43T l and ( l44l i completes the proof. ■ 

Corollary 1 Consider a multi-antenna multi-hop network consisting of a single source, a single destination and full-duplex 
relays with exactly one relay in each hop and assume that all the nodes are equipped with N antennas. Providing L is large 
enough such that L > N^, the RS scheme achieves the optimum DMT, which is the piecewise-linear function connecting the 
points (k, {N - k'f), fc = 0, 1, . . . , TV. 

B. Parallel Relay Network 

In this subsection, we consider the setup of a multi-antenna parallel relay network. In specific, we consider a two-hop 
network consisting of K > 1 half-duplex relays with the assumption that there is no direct link between the source and the 
destination. The source and the destination are shown by nodes and K + 1, respectively, while the K parallel relays are 
denoted by the nodes 1,2, ... ,K. Earlier, the optimum DMT of the single-antenna parallel relay network is derived by [1] 
and independently by [18]. Indeed, it is shown in [1] that the RS scheme can achieve the optimum DMT of the single-antenna 
multiple-access parallel relay network. However, much less is known regarding the DMT of the multi-antenna parallel relay 
networks. 

Here, we show that the RS scheme achieves a better DMT with respect to the traditional AF relaying and also with respect 
to the other results reported in the literature. Moreover, we show that the RS scheme achieves the optimum DMT of the 2-relay 
parallel relay network. 

Theorem 3 Consider a multi-antenna parallel relay network consisting of a source equipped with m antennas, a destination 
equipped with n antennas and K half-duplex relays each equipped with p antennas. Assume that there exists no direct link 
between the source and the destinatio^ For any fixed B > min^ (p, q) max(p, q), the RS scheme ^ with L = BK number of 
paths, S — BK + 1 number of slots, the path sequence 

Q EE (qi, . . . , q^, qi, . . . , q^, . . . , qi, . . . , q^), 

in which q^ = (0, k, K -\- 1), and the timing sequence Sij ~ i + j — 1, achieves the diversity gain 

dRs{r)>Kd.p,,(^(^l + ^yy (45) 

*Note that in this theorem, the relays are not assumed to be isolated from each other, i.e., there may exist some links between the relays. 
'The reader is encouraged to have a look at the definition of RS scheme and its parameters in [1]. 
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where q = min(m, n) and dpxq{r) denotes the diversity gain of the point-to-point p x q multi-antenna Rayleigh fading channel 
corresponding to the rate rlog(P). Moreover, as B ^ oo, the RS scheme achieves the diversity gain Kdpxq{r). 

Proof: The proof steps are the same as the ones presented for Theorem 3 of [1] and Theorem 6.7 of [18]. Let us denote 
the channel between the fc'th relay and the source and the channel between the fc'th relay and the destination by and Gfc, 
respectively. Moreover, let us define r{i) = (i — 1) mod K + 1. Similar to the proof of Theorem 3 in [1], one can easily 
show that the end-to-end channel from the source to the destination can be shown by a block lower-triangular matrix. More 
precisely, we have 

y = T^i + Qur + Ud. (46) 

Here, x denotes the vector corresponding to all the paths transmitted by the source, y denotes the vector corresponding to all 
the paths received by the destination. 



/ 



Fii 

F2,l F2,2 








where l^F, ,; = Gr(i)ai@,Hr(i), and 



Q 



Qi,i 

Q2,l Q2,2 



(47) 



(48) 



V Ql,i Ql,2 ••■ Ql,l J 
where Q^ ^ = Gr(i)ai&i. The DMT of the end-to-end channel is equal to 

log(P{/(x;y) < (L + l)rlog(P)}) 



dB.sir) = lim -- 



log(P) 



loe 



= lim - 

P^oo 



g {f [log (^Ln + PJ'^F'^ {iLn+QQ") '|) < (L + l)r log(P) }) 



log(P) 

Noting T is block lower-diagonal and applying Theorem 3.3 in [18], we have 



Ii„ + PTT"" {iLn + QQ"") ' I > n 



1=1 



(49) 



(50) 



Note that according to the constraint in the RS scheme, we have a; < 1. Hence, one can apply the same argument as in 
Lemma |2] and show that 



n 

1=1 



<P(^+^)'"| =p|n|l„ + PFuFf;| <P(^+1)'-| 



(51) 



'"As only the value of is needed in the proof of Theorem 3, we just give the value of i here. The formula for F^ j, i < j, is much more complicated 
and hence, we decide not to bring it. The same thing for Qij defined right after this. 
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First Hop 
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Fig. 2. A schematic of the MIMO parallel 2 relays network with no direct hnk between the source and the destination and also between the relays 

Moreover, using the argument in the proof of Theorem [H one can show that with probabiHty one, we have ai = 1. Hence, 
defining Ak,b = Gk&(b-i)K+kiik, we have 



n |i„ + PF,,rf,i < p(^+i)'-| = p |n n ii« + ^Afc,,A« I < p(^^^^^ 

J [k=lb=l J 



(52) 



Let us define the 1 x K random vector cr = [cti, (T2, . . . , cta'] where ak — ^i>=i ^"^^'^^^j^'"'''^'''''^ . Notice that cTfc's are 
independent of each other. As P > min^(p, q) max(p, q), we can apply Theorem [T] to crfe's. Hence, for any fixed 1 x K vector 
o- > 0, we have 



(a) 



(53) 



Here, (a) resuhs from Theorem [T] and the fact that CTfe's are independent of each other. Denoting the outage event as O, 
according to (|49]l, ^U^, (jSB, and (|52li, we have 



Let us define the region TZ = |cr > X^aLi '^k — + "§■) ^ }■ have 



(54) 



(a) 

'{o}<p{7^} < p- 



(55) 



Here, (a) results from Lemma |5] (6) results from the fact that dpxq{r) is a convex decreasing function and, as a result, we 



have ^ J2k=i dpxqicTk) > rfpxq(-^ Efc=i ^fe) > dp^q ((l 



BKJ 



) . S55[ completes the proof of the Theorem. 



In the following Theorem, we show that the RS scheme achieves the optimum DMT for the two-relays half-duplex parallel 
relay network in which m = n, but the two parallel relays can have a different number of antennas but there is no direct link 
between them. Figure |2] shows the schematic of such a network. 

Theorem 4 Consider a multi-antenna parallel relay network consisting of a source and a destination each equipped with m 
antennas, and K ~ 2 half-duplex relays equipped with Uk, k = 1,2 antennas. Assume that there exists no direct link between 
the source and the destination and also between the two relays. Consider the RS scheme with L = BK, S = BK + 1, and 
the path and timing sequences defined in Theorem \3\ As B ^ oo, the RS scheme achieves the optimum DMT of the network. 
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Proof: First, notice that according to the argument of Theorem [3] as i? cx), the RS scheme achieves the DMT 
dRS,oo{r) — mino<,y<2r c^mxni (i^) + c?mxn2(2'" ^ t^)- Now, to prove the Theorem, we just have to show that (ifls,oo(?') is 
indeed an upper-bound for the optimum DMT. According to the cut-set Theorem [21], we have an upper-bound for the capacity 
of the network for each channel realization. Hence, we can apply the cut-set Theorem to find an upper-bound for the optimum 
DMT. In general, for any arbitrary half-duplex relay network with K relays and any set {0} C 5 C {0, 1, ... , K}, we say the 
network is in the state S, if the network nodes in S are transmitting and the network nodes in 5^ = {0, 1, . . . , X + 1} /S are 
receiving. Notice that as the source is always transmitting and the destination is always receiving, we have E S,K +1 E 5"^. 
Accordingly, we define a 1 x 2^ state vector p such that for any set {0} C 5 C {0, 1, ... , K}, ps shows the portion of time 
that the half-duplex relay network spends in the state S {J2{o}csc{o i k} Ps ^ !)■ the channels are assumed to be fixed 
for the whole transmission period and the relay nodes and the source are assumed to have no channel state knowledge about 
their forward channels, we can assume that a fixed state vector p is associated with the strategy that achieves the optimum 
DMT. Denoting the outage event by O, for any general half-duplex relay network consisting of K relays, we have 



'{O} > minP<' IJ I psI{X{Sr\T)-Y {ST\r)\X{Sr\r)) <r\og{P) 

^ {o}crc{o,i,...,i<-} \ {o}c5c{o,i,- -,i<'} 



= min max P <^ V psi iX (S C^T) ]Y iS" C^r)\X (S C^r'')) < rlogiP)) . (56) 

p {o}crc{o,i A'} ^-^ 

Here, (a) follows from the cut-set bound Theorem [21] and (6) follows from the union bound on the probability. Now, 
in our two-relay parallel setup, let us define two sets Ti = {0, 1} and T2 {0, 2} corresponding to the two cut-sets. 
Moreover, let us define two events d and O2 as d = {|l,„ + PGiGf | < i>log(F), |l„2 + FHaHf | < (2r - v) log(P)} 
andd = {|I„, +PHiHf I < i> log(P), |l™ + PGaGf | < (2r - i>) log(P)} where i> ^ argmin d^xm M+(i„xx». (2r-z/). 

0<v<2r 



Hence, in our setup, (I56t can be simplified as 

P{C} y minmaxjpJ ^ psi {X {S r\Ti) -Y {ST\T{:)\X {S r\T^)) < rlogiP)^ 

^ PsI{X{SC^ %) ;Y{S-n %^)\ X{Sn %^)) < r log(P) 

_{0}C5C{04,2} 

> mmmax(P{(p{04} +/0{o,i,2}) |l™ + PGiGf | + (p{o} + P{o,i}) |l„2 +^H2Hf | < rlog(P)} , 

P {(P{0.2} + P{0,1.2}) |lm + PG2G^\ + (p{o} + P{0,2}) + PHiHf | < r log(P)}) 

> mm max (1 [r - (p{o,i,2} +P{o,i}) ^ ^ iP{o.i} + P{o}) C^r - />)] P{Oi}, 
1 - (P{o} + P{o.2}) ~ (P{o,2} + P{o,i:2}) (2r - i>)] P {O2}) 

where = 1 for x > and is otherwise. Here, (a) results from taking the maximization of the right-hand side of ( |56l ) over 
7i,72. (6) results from the facts that i) conditioned on Oi and assuming r > (p{o,i,2} + P{o,i}) (P{o,i} + P{o}) (2r— t>), 
we have (p{o,i} + P{o.i,2}) |lm + -PGiGf | + (p{o} + P{o,i}) |ln2 + -PH2Hf | < rlog(P); and ii) conditioned on O2 
and assuming r > (p{o} + P{o,2}) + (P{o,2} + P{o,i:2}) C^r - i')^ we conclude that (p{o,2} + P{o,i:2}) |lm + ^G2Gf | + 
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Multiplexing Gain r 

Fig. 3. Parallel relay network with K = 2 relays, each node with 3 antennas and no direct link between source and destination. 

(P{o} +P{0.2}) |lni +PHiHf I < rlog(P). (c) results from OPjOi} = PjOa} = p-'i™x„i(i>)-rf™x„2(2'-'>) = p~dnsMr) 
and ii) po + po.i + Po,2 + Po,i,2 = 1 (due to the definition of p) which results in having r — (p{o,i,2} + P{o,i}) ^ ~ 
(p{oa} + P{o}) (2r - j>) = - [r - (p{o} + P{o,2}) J> - (P{o.2} + P{oa,2}) (2r - v)] and consequently, 

1 ['^ - (P{o,i,2} + P{o,i}) i' - (P{0,i} + P{o}) (2r - i>)] + 1 [r - (p{o} + P{o,2}) i> - (p{o,2} + P{o,i.2}) (2r - v)] = 1. 

dSTI l completes the proof of the theorem. ■ 
Figure |3] shows the DMT of various schemes for the parallel relay network with K — 2 relays and m = n ^ p ~ 3, i.e. 
3 antennas at each node. As it is shown in Theorem HI the RS scheme achieves the optimum DMT. However, if we do not 
apply random unitary matrix multiplication at the relay nodes, applying the steps in the proof of Theorem [3] one can easily 
show that the RS scheme achieves the DMT of KdGii{r), where dcHi^) denotes the DMT of the product of the channel 
matrix H from the source to the relay and the channel matrix G from the relay to the destination (see (|4]i). Finally, applying 
the NAF scheme of [2], [17], one can easily show that the DMT Kdcmi'^r) is achievable. 

C. Multiple-Antenna Single Relay Channel 

In this subsection, we consider the most-studied scenario in the relay network, the single relay setup, in which a direct link 
exists between the source and the destination. The relay is assumed to be half-duplex. There have been extensive research on 
this particular setup toward characterization of the DMT. The authors of [2] have shown that the NAF scheme achieves the 
best DMT among all possible AF relaying schemes for the Single-Input Single-Output (SISO) single half-duplex relay channel. 
However, here we show that using independent uniformly random unitary matrices across different time-slots improves the 
DMT of the NAF scheme for the multi-antenna setup. In order to exploit the potential benefit from random unitary matrix 
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multiplication, the source transmits in 2B consecutive time-slots. In the odd time-slots, the relay listens to the source signal. 
In the even time slots, the relay multiplies the received signal from the last time-slot with a uniformly random unitary matrix 
and then amplifies the result with the maximum possible coefficient, which is less than or equal to 1. The destination decodes 
the transmitted message based on the joint decoding of the signal it receives in the 2B time-slots. 



Theorem 5 Consider a single relay channel consisting of a source, a half-duplex relay, and a destination equipped with m, p, 
and n antennas, respectively. Let us consider a modified NAF scheme that benefits from the random unitary matrix multiplication 
at the relay node and the joint decoding at the destination side through 2B time-slots. Assuming B > min^ (p, q) max(p, q) 
where q = min(TO, n), the modified NAF scheme achieves the following DMT 



dMNAF{r) > d 



(58) 



Proof The proof is similar to the proof of Theorem |3] Indeed, assuming the source-destination, source-relay, and relay- 
destination channel matrices are denoted by F, H, and G, respectively, we can show that the end-to-end channel matrix is 
equal to 



F 











aG0iH 


F 














F 













F 



v 



(59) 



Here, we observe that the lower-diagonal elements are independent of the diagonal elements. Hence, we can apply Theorem 
3.3 in [18]. Accordingly, the DMT corresponding to the end-to-end system is greater than or equal to the summation of the 
DMT of the forward channel F and the DMT of a two-hop channel utilized in half of the time. In other words, dMNAF{r) > 
rfmxn(f ) + dpxqi2r). Details of the proof are similar to the proof of Theorem[3j ■ 
Figure |4] compares the achievable DMT of the NAF scheme with the achievable DMT of the modified NAF scheme for the 
single relay channel with m — n = p = 3 and m = n = p = 4 antennas. Reference [13] has shown that the NAF protocol 
achieves the DMT djvAF('") > drnxnir) + dcHi'^'r). As we observe, the modified NAF scheme outperforms the NAF scheme 
for small values of r. 



D. General Full-Duplex Relay Networks 

Here, we generalize the statement of Remark 6 in [1] and Theorem 4.2 in [18] to the multi-antenna case. Indeed, it is 
shown in [1] that the RS scheme achieves a linear DMT connecting the points (0, dmax) and (r,„axiO)^ where dmax denotes 
the maximum diversity and r^ax denotes the maximum multiplexing-gain (which is 1) for single-antenna full-duplex relay 
networks whose underlying graph is directed acyclicO However, in the multi-antenna setup, [1] could only show that the RS 
scheme achieves the maximum diversity gain. To generalize the statement, we have to review the definitions from [1]. 

"a directed graph is called directed acycUc if it contains no directed cycle [22] 
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Fig. 4. DMT of the NAF scheme versus the modified NAF scheme for Multiple-antenna single relay channel with i)3 antennas ii)4 antennas at each node. 

Definition 1 A general multi-relay network with the connectivity graph G = (V, E) is a wireless network with the vertex set 
V — {0, 1, . . . , K -\- 1} where and K + 1 represent the source and the destination, respectively, and the other nodes represent 
the relays. Each pair of vertices that belong to E are connected together through a quasi-static Rayleigh-fading channel and 
the pairs of vertices that do not lie in E are disconnected from each other The number of antennas at node i is denoted by 
N,. 

Definition 2 For a network with the connectivity graph G — (V, E), a cut-set on G is defined as a subset S ^ V such that 
Cz S, K + I G iS"^. The weight of the cut-set corresponding to S, denoted by wg{S), is defined as 

wciS) = ^-^b- (60) 

aeS,b(ES'',(.a,b)eE 

Theorem 6 Consider a full-duplex multi-antenna multi-relay network with the graph G — (V, E) where G is directed acyclic. 
Assume each node has N antennas. The RS scheme achieves the following DMT 

dRs\r) = mmwci^j ' ("1) 

where d^xNif) denotes the DMT of a N x N multi-antenna channel. 

Proof Using the same path sequence as the one in the proof of Remark 6 in [1] and applying the result of Corollary 
[1] i6l[ can be derived. The steps are the same as the steps in the proof of Theorem [5] noting the equivalent point-to-point 
channel is block lower-triangular and the function dNxN{r) is convex decreasing. ■ 
Remark 1- According to d6TT i. a specific RS scheme with fixed path and timing sequences can simultaneously achieve the 
maximum diversity gain, which is mvag wq{S) (refer to [1]), and the maximum multiplexing gain, which is in a multi- 
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minimum cut • • 

Fig. 5. An example of a multi-antenna directed acyclic network with full-duplex relays, each node equipped with 2 antennas. 

antenna relay network whose underlying graph is directed acyclic. In other words, the RS scheme is robust in the sense that 
it achieves the corner-points of the optimum DMT with no modification of the scheme parameters. 

Figure |5] shows an example of a directed acyclic network. The relays are operating in the full-duplex mode and each node is 
equipped with two antennas. Here, the weight of the minimum cut-set depicted in the figure is 8. Hence, applying the argument 
of Theorem|6] the RS scheme achieves djis{r) — 2d2x2{'>')- However, the DMT upper-bound is equal to dubir) = d2x4(f), 
which is obtained from the same cut-set. Although the two DMT's are equal in the corner-points, they do not coincide in 
between. 

IV. Conclusion 

In this paper, we derived new DMT results in various setups of multi-antenna relay network using AF relaying. For this 
purpose, the application of RS scheme proposed in [1] was studied. It was shown that random unitary matrix multiplication 
at the relay nodes enables the RS scheme to achieve a better DMT in comparison to the traditional AF relaying. First, the 
multi-antenna full-duplex single-relay two-hop network was studied for which the RS scheme was shown to achieve the 
optimum DMT. This result was also generalized to the multi-antenna multi-hop full-duplex relay network with one relay in 
each hop. Next, applying this result, a new achievable DMT was derived for multi-antenna half-duplex parallel relay networks. 
Interestingly, it turned out that the DMT of the RS scheme is optimum for the multi-antenna half-duplex parallel two-relay 
setup with no direct link between the relays. Moreover, using random unitary matrix multiplication was shown to improve 
the DMT of the NAF scheme of [2] in the setup of the widely used multi-antenna single relay channel. Finally, the general 
full-duplex multi-antenna relay network was studied and a new lower-bound was obtained on DMT using the RS scheme, 
assuming that the underlying network graph is directed acyclic. 

Appendix I 
Proof of Lemma[I] 

Assuming a multiplexing gain of r, the diversity of the original system can be written as 

log(p|log I + a2pGHH^G^ (l + a^GG^)"^ <rlog(P)|) 



d{r) = lim — 



P^oo log(P) 



(62) 
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Since a^GG^ h 0, it follows that 



I + a^PGHH^ I + a^GG 



< I + a^PGHH«G" . This implies that 



d{r) < du{r) = lim — 

P^oo 



log(P{log|l + a2pGHH^G^| < rlog(P)}) 



(63) 



log(P) 

which is the DMT of the system in (2]|- Moreover, selecting a — min ^1, EjUHxf+n ||^} ) (which guarantees that the output 
power of the relay remains bellow Po, we have 



a^GG" ^ IIGII^I. 



(64) 



Defining the event '■^ = {\\G\\^ > clog(P)}, and noting that ||G|p is a Chi-square random variable with 2pn degrees of 
freedom, we have 



P{<r} 



E 



fc! 



A;=0 

(i(log(P))P"-ip-^ 



(65) 



where d = |j . Defining the outage event of the system in ([T]l as O, we have 

p{o} = p{e'|'g'}p{'^^} + p{C'|^'=}p{'^^} 

< F{'^} + P{0|'^'=}P{"^'=}. 

Conditioned on the probability of the outage event can be upper-bounded as 

P 



P{0\'if''} < P Oog 



1 + 



-GHH^G^ 



< rlog(P) 



(66) 



(67) 



clog P + 1 

which is equal to the probability of the outage event of the system in Q, denoted as Oi, conditioned on In other words 
ViOl'^"} < YiOiY^"}. Substituting in ^ yields 



P{0} < Pj"^} +P{0,|'r'=}P{'^=}. 
< P{<r}+P{0,}. 



(68) 



Since the capacity of the two-hop network is equal to the capacity of both the source-relay and the relay-destination links, it 
follows that the outage event O includes Osr, the event of outage in the source-relay link, and Ord, the event of outage in 
the relay-destination link. As a result, 

P{0} > max{P{C},P{ad}} 

(a) 

> max{p-™f,p-P"} 



> p-pn 



(69) 



(a) results from the fact that the outage probability corresponding to the multiplexing gain r is greater than or equal to the 
outage probability corresponding to the multiplexing gain 0. Setting c = 2pn — 1, from (l65T l. it is concluded that 



¥{0} 



p 



pn—l'^ 



(70) 



Note that as we are looking for a lower-bound for the DMT, it is fine to select any arbitrary value for a. 
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From the above equation and ( |68T l, it follows that 

V{0}<P{Oi}, (71) 

which incurs that the DMT of the original system is lower-bounded by the DMT of the system in (O. This completes the 
proof of Lemma [T] 

Appendix II 
Proof of Lemma |2] 

Defining P' = ciog{'p)+i ' '■^^ DMT of the system in (O can be written as 

logfp|log I + a^P'GHH^G^ <rlog(F)|) 
di{r) lim ^ -'^ 



P^oo log(P) 

logfpllog I + a2p'GHH^G^ <r'log(P')}) 
~ lim — — 



log(P') 



du[r ) 



= du{r), (72) 

where r' = r^^siD. and (a) follows from the fact that as P' = -^j^^^^, we have P = P' [clog(P') + 0(loglog(P'))], 
which implies that limp^oo ^logiP) = 1- I" other words, in the first line of the preceding equation, one can substitute log(P) 
by log(P'). (6) results from the fact that the second line in the right hand side of the preceding equation is exactly the DMT of 
the system in Q at r', which is denoted by du{r'). Finally, (c) follows from the facts that limp^oo ^ = 1 continuity 
of the DMT curve which implies that du{r') = du{r). This completes the proof of Lemma 3. 

Appendix III 
Proof of Lemma [3] 

First, notice that for values of i > min{m,p} or i > min{p, n}, \i{G) or Aj(H) are defined as zero. Hence, the argument 
of this lemma is obvious in these cases. Now, we prove the argument for i < min{m, n,p}. According to the Courant-Fischer- 
Weyl Theorem [23], we have 

Ai(G0H)= max min — ^" . (73) 

5,dim(5)=i ||x|| 
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Now, let us define Sq — (vi (H) , V2 (H) , . . . , (H)) where Vj (H) denotes the j'th column of V (H) and (ai, a2, . . . , a^) 
denotes the span of ai, a2, . . . , afc. We have 



X, (G0H) > min 



(a) 



> 



id) 
> 



> 



(/) 



G0Hx| 



xeSo 



mm 



mm 



mm 



|G0HV(i,,) (H)x'l 



G0U(H)A^(H)I„x«x' 



G0U(H)Af,^^)(H)x' 



mm 

iix'ir>i 



mm 

x'eC 

l|x'|P>l 



G©U(H)Af,_^)(H)x' 



G0U(i.,)(H)Af(H)x' 



min A,(H)||G0U(i,)(H)y'f 

l|y'ir>i 

min A,(H) A^ (G)V^(G)0U(i,,) (H)y' 

y'eC 

l|y'ir>i 



min A.(H) Af, ,)(G)Vf[^,)(G)0U(i,,)(H)y' 

l|y'ir>i 



min A,(G)A,(H) ,)(G)0U(i,,)(H)y' 

y'eC 

l|y'ir>i 

A,(G)A,(H)A^in (v[^^^)(G)0U(i.,)(H)) , 



(74) 



where I„xi denotes the diagonal identity n x i matrix and A^^^ (H) denotes the square submatrix of A2 (H) consisting of its 
first i rows and first i columns. Here, (a) follows from the fact that i) x e 5o is equivalent to x = V(i ^jx' for some x' £ C*; 
and ii) ||x||^ = ||x'||^. (6) follows from the fact that for any x' G C*, ||x'||^ > 1, defining y' = ^=^=A?x', we have 

VAi(H) 

lly'll > ||x'|| > 1- (c) follows from the fact that for any unitary matrix P, we have ||PA|| = ||A|| . (d) follows from the 
fact that defining z' = 0U(i (H)y', we have 



A2(G)V^ (G)z' 



Afi,,o(G)Vfi,,,(G)z 



+ 



^f.+i,™)(G)V(I+i,,„)(G)z' 



A?, .,w' 



(e) follows from the fact that defining w' = (G)0U(i (H)y', we have 

follows from the Courant-Fischer-Weyl Theorem [23]. ( l74l i completes the proof of Lemma. 



> A,(G) llw'f . Finally, (/) 
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Appendix IV 
Proof of Lemma |4] 



We have 



i-l 



(a) 

< Ami„(*»,i) 



Tr- 



i-l 



i-l 



(75) 



where (a) follows from the Geometric Inequality and the fact that J2]=i ^{"^1,1) < Tr | '5'^ ; | . Tr Ivfij^f^ | can be 
upper-bounded as follows: 

Tr{*u*^^^} = ||v«,)(G)0,U(i,,)(H)||' 

< ||V(i,,)(G)||'||0,U(i,,)(H)f 



(^) 11 1,2,, ,,2 

< ||V(i,,)(G)f ||U(i,,)(H)f 



(76) 



In the preceding equation, (a) follows from the the fact that ||AB|p < ||A||^||B|p for any two matrices A and B. (6) results 
from the fact that ||0A|| = ||A||, for any matrix A and any unitary matrix 0. Finally, (c) follows from the fact that as U(H) 
and V(G) are unitary matrices, each of their columns has unit norm. Combining ( |75l ) and ( |76] l yields 



< Amin(* 



which implies that 



Amin > C 



i-l 



i-l 



for some constant c. Denoting the jth column of as write the determinant of \I'i.;'4'f; as 

i 



(i) 

where fUj denotes the square norm of the projection of / over 
Combining dTSl l and ( |79l ) yields 



(77) 



(78) 



(79) 



the null-space of the subspace spanned by 



(80) 



(a) 



where k = ^ and (a) follow from the union bound on the probability, '^'f^ can be considered as the projection of the sth 
column of the matrix R; ^ 0;U(i i)(H), denoted by over the i-dimensional subspace spanned by {Vf(G)}J^]^, which 
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is denoted by P. Now, consider a random unit vector w in V, which is orthogonal to the first j — 1 columns of the matrix 
R;. Since {'S',['';''}s^i are the projections of the first j — 1 columns of R/ over V, it follows that w is also orthogonal to 
{^^^■'l^Zj. In other words, w belongs to Q^, the null-space of the subspace spanned by ^. Hence, we have 



or 
i.i 



> 



■,U) 



H 



* Q 

2 



(81) 



where * T denotes the projection of the vector a over the subspace T. The second line in the preceding equation follows 
from the fact that the norm of the projection of a vector over a subspace is greater than the norm of the projection of that 
vector over any arbitrary unit vector in that subspace. Since '4'^ / is the projection of R; over P, RJ can be written as 



R 



(i) 



i.l ' 



(82) 



where '4'^ ; denotes the projection of RJ over P , the null-space of P. Since w e it follows that w '^Yi — 0- This 
means that "^f^ w = R^-* w. Combining the above with (ISTT l yields 



> 



(83) 



Since 0; and U(H) are unitary matrices and U(H) is isotropically distributed [20], it follows that 0;U(H) is an isotropic 



unitary matrix. This means i) |r|*' | is an orthonormal set, which implies that Rp'' is orthogonal to 
and ii) Rp-* is an isotropic unit vector As a result, Rp' is an isotropic unit vector in the {p — j + l)-dimensional subspace 
perpendicular to |Rp'| . Noting that w is also in this subspace and Rp'' and w are independent of each other, from [24], 



Lemma 3, the CDF of Z, 



Combining and 



R 



(i) 



H 



can be computed as 
it follows that 

i 

P{A,„i„(*,u) <£} < 

i 

- n 



(84) 



1 - 1 - 



' KS 



(85) 



where 77 = i{p — ^-ti).^. In the above equation, (a) follows from the fact that (1 — a;)" > 1 — nx, Vn > 0, < a; < 1. This 
completes the proof of Lemma H) 

'^Note that as j < i, has at least dimension of 1. 
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Appendix V 
Proof of Lemma |5] 

We can assume that /(x) is defined such that x < y ^ /(x) < /(y); otherwise, we can redefine /(x) as /(x) = 
infy>x /(y)- Let us define the set S{e) for every e > as S{e) = {x e [0, oo)" |d(x, 7^) < ^/Ee, f e Z" } where d(x, 7^) = 
infygK ||x — y||. Also, let us define the partial order < for two sets A,BC [0, cx))" as follows: ^ < S iff for every 
X e S there exists n y A such that y < jfl One can easily verify that 5(e) < TZ. Now, let us define the set £(e) as 
C{e) ^ {x e S{e) \$y G 5(e), y < x}, i.e. £(e) consists of the minimal members of 5(e). Notice that the elements of 5(e) 
can be mapped to the elements of Z" such that the partial order < between the real vectors is kept between the corresponding 
integer vectoro- For every subset A C Z" and every x e A there exists a minimal member y £ A such that y < x. 
Accordingly, we have such a property for 5(e). This means £(e) < 5(e). Noticing 5(e) < TZ, we conclude that £(e) < TZ. 
On the other hand, it is easy to check that £(e) is a finite set, i.e. |£(e)| < oo. Hence, we have 

P{7^} < P J 11 (x > x') i < |£(e)| max P{x > x'} < p-"""-e^(^) /W, (86) 

where (a) follows from £(e) < TZ and (b) follows from the fact that |£(e)| < oo. Now, let us define h{e) = minxg£(e) /(x). 
For every e, we have h{e) < infxgTj /(x) (otherwise, according to ( [86] l, the statement of the Lemma is proved). Now, we prove 
that lim^^fO ^{e) exists and in fact, we have limg^o h{e) = infxeTj /(x). As /(x) is uniformly continuous, there exists a positive 
function g{e) (.g(e) > 0) such that for all x, y, ||x — y|| < g{e), we have |/(x) — /(y)| < e. Consider any positive constant 
6 > and any e < According to the definition, for any x G >C(e), there exists ay £ TZ such that j|x — yj| < ^/ne < g{S). 
Accordingly, |/(x) — /(y)| < S. Hence, we have h{e) = minxg£(e) /(x) > infxeTj /(x) — 6. On the other hand, we know 
h{e) < infxgK /(x). This proves lim^^o h{e) = infxeK /(x). Noticing lim^^o 'i(e) = infxeK /(x) and applying ( [86] l proves 
the Lemma. 
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